AITopics | modular network

Collaborating Authors

modular network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Modular Networks: Learning to Decompose Neural Computation

Louis Kirsch, Julius Kunze, David Barber

Neural Information Processing SystemsFeb-19-2026, 16:33:54 GMT

Scaling model capacity has been vital in the success of deep learning.

artificial intelligence, machine learning, module, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.14)
Oceania > Australia (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

32cfdce9631d8c7906e8e9d6e68b514b-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 00:47:27 GMT

module, reviewer, variance, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Continual Learning via Local Module Composition

Neural Information Processing SystemsDec-25-2025, 08:27:53 GMT

Modularity is a compelling solution to continual learning (CL), the problem of modeling sequences of related tasks. Learning and then composing modules to solve different tasks provides an abstraction to address the principal challenges of CL including catastrophic forgetting, backward and forward transfer across tasks, and sub-linear model growth. We introduce local module composition (LMC), an approach to modular CL where each module is provided a local structural component that estimates a module's relevance to the input. Dynamic module composition is performed layer-wise based on local relevance scores. We demonstrate that agnosticity to task identities (IDs) arises from (local) structural learning that is module-specific as opposed to the task-and/or model-specific as in previous works, making LMC applicable to more CL settings compared to previous works.

continual learning, module, name change, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.55)

Add feedback

Modular Networks: Learning to Decompose Neural Computation

Neural Information Processing SystemsNov-20-2025, 22:01:04 GMT

Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number of parameters with a relatively small increase in resources. We propose a training algorithm that flexibly chooses neural modules based on the data to be processed. Both the decomposition and modules are learned end-to-end. In contrast to existing approaches, training does not rely on regularization to enforce diversity in module use. We apply modular networks both to image recognition and language modeling tasks, where we achieve superior performance compared to several baselines. Introspection reveals that modules specialize in interpretable contexts.

decompose neural computation, modular network, name change, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Modular Networks: Learning to Decompose Neural Computation

Louis Kirsch, Julius Kunze, David Barber

Neural Information Processing SystemsNov-20-2025, 15:29:39 GMT

Scaling model capacity has been vital in the success of deep learning.

artificial intelligence, machine learning, module, (17 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Jordan (0.04)
(2 more...)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

empirical studies

Neural Information Processing SystemsOct-2-2025, 15:09:10 GMT

Our approach enables efficient optimization and sharing across modules. R1:"motivate their work very well, is technical sound," R2:"idea seems to be new," R3:"very important problem, better We will address reviewers' comments as follows. Theoretical grounding: the paper is not well grounded in neural network theory. R2 also asks "Why a dot product for the weighting?" But weighting itself indicates multiplication. R2 has not provided an alternative way for weighting. Meta-learning is attracting, Comparison to state-of-the-art (e.g. R2 also has not provided a reference on multi-task RL for us to compare. How to adopt it in multi-task RL is an interesting direction to study, but it is out of the scope of our paper. While R2 complains about our writing, other reviewers all have positive feedback: "I liked to read the paper, For the routing network, the inputs are the same as the policy including both states and task embedding.

artificial intelligence, machine learning, module, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Continual Learning via Local Module Composition

Neural Information Processing SystemsJan-19-2025, 15:27:58 GMT

Modularity is a compelling solution to continual learning (CL), the problem of modeling sequences of related tasks. Learning and then composing modules to solve different tasks provides an abstraction to address the principal challenges of CL including catastrophic forgetting, backward and forward transfer across tasks, and sub-linear model growth. We introduce local module composition (LMC), an approach to modular CL where each module is provided a local structural component that estimates a module's relevance to the input. Dynamic module composition is performed layer-wise based on local relevance scores. We demonstrate that agnosticity to task identities (IDs) arises from (local) structural learning that is module-specific as opposed to the task- and/or model-specific as in previous works, making LMC applicable to more CL settings compared to previous works.

continual learning, local module composition, module, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.57)

Add feedback

Reviews: Modular Networks: Learning to Decompose Neural Computation

Neural Information Processing SystemsOct-7-2024, 07:50:53 GMT

The paper is concerned with conditional computation, which is an interesting topic yet at early stages of research, and as such one that requires much research and investigation. The paper proposes a latent-variable approach to constructing modular networks, modeling the choice of processing modules in a layer as a discrete latent variable. A modular network is composed of L modular layers, each comprised of M modules and a controller. Each module is a function (standard layer) f_i(x; \theta_i). The controller accepts the input, chooses K of the M modules to process the input, and outputs the as the module output. Modular layers can be stacked, or placed anywhere inside a standard network.

computation, decompose neural computation, module, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback

Modular Neural Networks for Time Series Forecasting: Interpretability and Feature Selection using Attention

Su, Qiqi, Kloukinas, Christos, Garcez, Artur d'Avila

arXiv.org Artificial IntelligenceNov-29-2023

Multivariate time series have many applications, from healthcare and meteorology to life science. Although deep learning models have shown excellent predictive performance for time series, they have been criticised for being "black-boxes" or non-interpretable. This paper proposes a novel modular neural network model for multivariate time series prediction that is interpretable by construction. A recurrent neural network learns the temporal dependencies in the data while an attention-based feature selection component selects the most relevant features and suppresses redundant features used in the learning of the temporal dependencies. A modular deep network is trained from the selected features independently to show the users how features influence outcomes, making the model interpretable. Experimental results show that this approach can outperform state-of-the-art interpretable Neural Additive Models (NAM) and variations thereof in both regression and classification of time series tasks, achieving a predictive performance that is comparable to the top non-interpretable methods for time series, LSTM and XGBoost.

amn, hearing aid usage, prediction, (14 more...)

arXiv.org Artificial Intelligence

2311.16834

Country:

Oceania > Australia (0.14)
Europe > United Kingdom > England > Greater London > London (0.14)
Asia > China > Beijing > Beijing (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Policy Stitching: Learning Transferable Robot Policies

Jian, Pingcheng, Lee, Easop, Bell, Zachary, Zavlanos, Michael M., Chen, Boyuan

arXiv.org Artificial IntelligenceSep-24-2023

Training robots with reinforcement learning (RL) typically involves heavy interactions with the environment, and the acquired skills are often sensitive to changes in task environments and robot kinematics. Transfer RL aims to leverage previous knowledge to accelerate learning of new tasks or new body configurations. However, existing methods struggle to generalize to novel robot-task combinations and scale to realistic tasks due to complex architecture design or strong regularization that limits the capacity of the learned policy. We propose Policy Stitching, a novel framework that facilitates robot transfer learning for novel combinations of robots and tasks. Our key idea is to apply modular policy design and align the latent representations between the modular interfaces. Our method allows direct stitching of the robot and task modules trained separately to form a new policy for fast adaptation. Our simulated and real-world experiments on various 3D manipulation tasks demonstrate the superior zero-shot and few-shot transfer learning performances of our method. Our project website is at: http://generalroboticslab.com/PolicyStitching/ .

latent representation, module, representation, (13 more...)

arXiv.org Artificial Intelligence

2309.13753

Country:

North America > United States (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Spain > Galicia > Madrid (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback